AMENDMENTS TO THE CLAIMS 



Please amend the claims as follows: 

1. (Currently amended) A computer-implemented method of text equivalence 
ing from a query string of characters comprising: 

modifying the query string of characters using a predetermined set of heuris- 
tics; 

performing a character-by-character comparison of the modified query string 
with a t least one known string of characters in a corpus in order to locate a 
match; and 

responsive to not finding an exact match, performing the steps of: 

forming a plurality of sub-strings of characters from the query string ef 

characters , the sub-strings having varying lengths; and 
using an information retrieval technique on the sub-strings formed from 
the query string of characters to identify determine a known string of 
characters equivalent to the query string of characters . 

2. (Currently amended) The method of claim 1, wherein the information re- 
trieval technique further comprises: 

weighting the sub-strings; 

scoring fee known strings of characters; and 

retrieving information associated with the a known string of characters with 
having the highest score. 
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3. (Currently amended) The method of claim 2, further comprising, respon- 
sive to the highest score being greater than a first threshold, automatically accepting 
the known string of characters having the highest score as an exact match. 

4. (Currently amended) The method of claim 2, further comprising, respon- 
sive to the highest score being less than a second threshold and greater than a first 
threshold, presenting the known string of characters having the highest score to a 
user for manual confirmation. 

5. (Currently amended) The method of claim 2, further comprising, respon- 
sive to the highest score being less than a second threshold and greater than a third 
threshold, presenting the known string of characters having the highest score to a 
user to select the equivalent string of characters. 

6. (Currently amended) The method of claim 1, wherein the sub strings of 
characters are 3 grams forming a plurality of sub-strings of characters comprises suc- 
cessively extending sub-strings based on frequency of occurrence in the modified 
query string . 

7. (Currently amended) The method of claim 1, wherein the query string ef 
characters is selected from the group consisting of a song title, a song artist, an album 
name, a book title, an author's name, a book publisher, a genetic sequence, and a 
computer program. 
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8. (Currently amended) The method of claim 1, wherein the predetermined 
set of heuristics comprises removing whitespace from the query string of characters . 

9. (Currently amended) The method of claim 1, wherein the predetermined 
set of heuristics comprises removing a portion of the query string of characters . 

10. (Currently amended) The method of claim 1, wherein the predetermined 
set of heuristics comprises replacing a symbol in the query string of characters with 
an alternate representation for the symbol. 

11. (Currently amended) The method of claim 1 further comprising storing a 
database entry indicating an indication that the query string of characters is fee an 
equivalent of the identified known string of characters . 

12. (Currently amended) A computer implemented system for text equiva- 
lencing from a query string of characters comprising: 

a heuristics module for modifying the query string of characters using a pre- 
determined set of heuristics; 

a comparator module, coupled to the heuristics module, for performing a 
character-by-character comparison of the modified query string with at 
least one known string of characters in a corpus in order to find a match; 

a sub-string formation module, coupled to the comparator module, for^ re- 
sponsive to not finding an exact match, fe* forming a plurality of sub- 
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strings of characters from the query string of characters , the sub-strings 
having varying lengths; and 
an information retrieval module, coupled to the sub-string formation module, 
for performing an information retrieval technique on the sub-strings 
formed from the query string of characters to identify determine a known 
string of characters equivalent to the query string of characters . 

13. (Currently amended) The system of claim 12, wherein the information re- 
trieval module further comprises: 

a weight module for weighting the sub-strings; 
a score module for scoring tke known strings of characters; and 
a retrieval module, coupled to the weight and score modules, for retrieving in- 
formation associated with the known string of characters with having the 
highest score. 

14. (Original) The system of claim 13, further comprising an accept module, 
coupled to the retrieval module, for accepting the information retrieved as an exact 
match for the highest score greater than a first threshold. 

15. (Original) The system of claim 13, further comprising an accept module, 
coupled to the retrieval module, for presenting the information retrieved to a user for 
manual confirmation for the highest score less than a first threshold and greater than 
a second threshold. 
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16. (Original) The system of claim 13, further comprising an accept module, 
coupled to the retrieval module, for presenting the information retrieved to the user 
as a set of options for a user to select for the highest score less than a second thresh- 
old and greater than a third threshold. 

17. (Currently amended) The system of claim 12, wherein the sub strings of 
characters aro 3 grams the sub-string formation module forms a plurality of sub- 
strings of characters by successively extending sub-strings based on frequency of oc- 
currence in the modified query string . 

18. (Currently amended) The system of claim 12, wherein the query string ef 
characters is selected from the group consisting of a song title, a song artist, an album 
name, a book title, and author's name, a book publisher, a genetic sequence, and a 
computer program. 

19. (Currently amended) The system of claim 12, wherein the predetermined 
set of heuristics comprises removing whitespace from the query string of characters . 

20. (Currently amended) The system of claim 12, wherein the heuristics 
module comprises a removal module for removing a portion of the query string ef 
characters . 
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21. (Currently amended) The system of claim 12, wherein the heuristics 
module comprises a replacement module for replacing a symbol in the query string 
of characters with an alternate representation for the symbol. 

22. (Currently amended) The system of claim 12 further comprising a data- 

I 

base update module for storing a database entry indicating an indication that the 
query string of characters is fee an equivalent of the identified known string of char 
actors . 

23. (Currently amended) A computer-readable medium comprising com- 
puter-readable code for performing text equivalencing from a query string of charac- 
ters comprising: 

computer-readable code adapted to modify the query string of characters us- 
ing a predetermined set of heuristics; 

computer-readable code adapted to perform a character-by-character com- 
parison of the modified query string with a t least one known string of 
characters in a corpus in order to locate a match; and 

computer-readable code adapted to, responsive to not finding an exact match 7 
adapted to : 

form a plurality sub-strings of characters from the query string of charac - 
ter s, the sub-strings having varying lengths; and to 

computer readable code adapted to use an information retrieval technique 
on the sub-strings formed from the query string of characters to iden- 
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tify detormino a known string of characters equivalent to the query 
string of characters . 



24. (Currently amended) The computer-readable medium of claim 23, 
wherein the information retrieval technique further comprises: 

computer-readable code adapted to weight the sub-strings; 
computer-readable code adapted to score the known strings of characters; and 
computer-readable code adapted to retrieve information associated with the a 
known string of characters with having the highest score. 

25. (Currently amended) The computer-readable medium of claim 24, further 
comprising computer-readable code, responsive to the highest score being greater 
than a first threshold, adapted to automatically accept the known string of characters 
having the highest score as an exact match. 

26. (Currently amended) The computer-readable medium of claim 24, further 
comprising computer-readable core, responsive the highest score being less than a 
second threshold and greater than a first threshold, adapted to present the known 
string of characters having the highest score to a user for manual confirmation. 

27. (Currently amended) The computer-readable medium of claim 24, further 
comprising computer-readable code, responsive to the highest score being less than a 
second threshold and greater than a third threshold, adapted to present the known 
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string of characters having the highest score to a user to select the equivalent string of 
characters. 



28. (Currently amended) The computer-readable medium of claim 23, 
wherein the sub strings of characters arc 3 grams computer-readable code adapted to 
form a plurality of sub-strings of characters comprises computer-readable code 
adapted to successively extend sub-strings based on frequency of occurrence in the 
modified query string . 

29. (Currently amended) The computer-readable medium of claim 23, 
wherein the query string of characters selected from a group consisting of a song ti- 
tle, a song artist, an album name, a book title, an author's name, a book publisher, a 
genetic sequence, and a computer program. 

30. (Currently amended) The computer-readable medium of claim 23, 
wherein the predetermined set of heuristics comprises removing whitespace from the 
query string of characters . 

31. (Currently amended) The computer-readable medium of claim 23, 
wherein the predetermined set of heuristics comprises removing a portion of the 
query string of characters . 

32. (Currently amended) The method of claim 23, wherein the predetermined 
set of heuristics comprises replacing a symbol in the query string of characters with 
an alternate representation for the symbol. 
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33. (Currently amended) The computer-readable medium of claim 23 further 
comprising computer-readable code adapted to store a database entry indicating that 
the query string updating the known string of characters to indicate the string of 
characters is the an equivalent of the identified known string of characters . 

34. (Currently amended) A computer-implemented system for performing 
text equivalencing from a query string of characters comprising: 

a modifying means for modifying the query string of characters using a prede- 
termined set of heuristics; 

a comparator means for performing a character-by-character comparison of 
the modified query string with a t least one known string of characters in a 
corpus in order to locate a match; 

a formation means for, responsive to not finding an exact match, a formation 
means for forming a plurality of sub-strings of characters from the query 
string of characters , the sub-strings having varying lengths; and 

an information retrieval means for determining identifying a known string of 
characters equivalent to the query string of characters . 

35. (Currently amended) The system of claim 34, wherein the information re- 
trieval means further comprises: 

a weight means for weighting the sub-strings; 

a score means for scoring the known strings of characters; and 
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5 a retrieval means for retrieving information associated with the known string 

6 of characters with having the highest score. 
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